SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

نویسندگان

چکیده

Zero/few-shot transfer to unseen services is a critical challenge in task-oriented dialogue research. The Schema-Guided Dialogue (SGD) dataset introduced paradigm for enabling models support any service zero-shot through schemas, which describe APIs natural language. We explore the robustness of systems linguistic variations schemas by designing SGD-X - benchmark extending SGD with semantically similar yet stylistically diverse variants every schema. observe that two top state tracking fail generalize well across schema variants, measured joint goal accuracy and novel metric measuring sensitivity. Additionally, we present simple model-agnostic data augmentation method improve robustness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust grammatical analysis for spoken dialogue systems

We argue that grammatical analysis is a viable alternative to concept spotting for processing spoken input in a practical spoken dialogue system. We discuss the structure of the grammar, and a model for robust parsing which combines linguistic sources of information and statistical sources of information. We discuss test results suggesting that grammatical processing allows fast and accurate pr...

متن کامل

Robust interpretation for spoken dialogue systems

Spoken dialogue systems must allow for robust and efcient interpretation of user utterances. This can be achieved by using shallow and partial interpretation. Partial interpretation is feasible together with a dialogue manager which provides information to guide the analysis. In this paper we present results on developing interfaces for information retrieval applications utilizing partial and i...

متن کامل

Star Schema Benchmark (ssb)

Big Data Analytics Benchmark (BigBench). Tags: pdgf Tags: star schema benchmark, ssb, parallel data generation framework, pdgf, benchmarking, skew. relational models which have been for a few years the most used to support classical data warehousing applications such as Star Schema Benchmark (SSB). Star. Schema Benchmark (6) is recently proposed datawarehousing benchmark that has been implement...

متن کامل

A Robust Control Design Technique for Discrete-Time Systems

A robust state feedback design subject to placement of the closed loop eigenvalues in a prescribed region of unit circle is presented. Quantitative measures of robustness and disturbance rejection are investigated. A stochastic optimization algorithm is used to effect trade-off between the free design parameters and to accomplish all the design criteria. A numerical example is given to illustra...

متن کامل

Robust parsing in spoken dialogue systems

The rule-based parsing is a prevalent method for the natural language understanding (NLU) and has been introduced in dialogue systems for spoken language processing (SLP). However, additional measures must be taken to cope with the severe spoken linguistic phenomena, such as garbage, repetition, ellipsis, word disordering, fragment and ill form, which frequently occur in the spoken language. We...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i10.21341